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In an effort to validate dynamic assessment methods 
influenced by Vygotsky' s (1978) definition of zones of proximal 
development (an indicator of readiness), three sets of experiments 
addressed two goals: the develoi^ent of diagnostic assessment methods 
and the use of diagnostic results to guide the design of 
instructional programs. The first two studies examined two 
psychological processes presumed to be significant predictors of 
academic performance: acquiring new information (learning) and using 
that information in novel situations (transfer). Results indicated 
the validity of the methods used, showing ^1) that groups o€ children 
of contrasting abilities do differ in terms of learning and transfer, 
with differences being greater in transfer; and (2) that learning and 
transfer are better prec^ictors of performance than either IQ or 
standardized test scores, with the best predictor being transfer. In 
the third study, which tested the instructional application of 
dynamic assessment results (emphasizing enhancing the transfer 
process), subjects taught to use four reading strategies demonstrated 
gains in therr ability to transfer these skills, as evidenced by 
their standardized and reading comprehension test scores. Results of 
these studies suggest that the best predictors of the extent to which 
individuals are likely to profit from instruction are their initial 
response to instruction (learning) and, even more, how successfully 
they can transfer their learning to novel situations. Results also 
suggest the superiority of dynamic assessment over static testing for 
planning for instruction. A seven-page list of references is 
inclut'ed. (LLZ) 
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Dynamic Assessment: 
One Approach and Some Initial Data 
Proponents of dynamic assessment methods are concerned with 
identifying students who are likely to experience academic 
problems and with providing descriptions of those students* 
strengths and weaknesses in such a way that remedial programs can 
be developed. A major stimalus for the interest in dynamic 
assessment procedures is a dissatisfaction with certain features 
of standardized "static" tests. In these static tests, children 
are asked for specific information or are required to solve 
certain types of problems. The tester provides no help during 
the testing session. The score individuals attain represents an 
estimate of their current, unaided level of conpetence. All too 
often, the unwarranted inference is made that these scores are a 
measure of ability level, i.e., an IQ score of 70 is seen as 
relatively permanent and resistant to change. In many cases, 
particularly when children from culturally different backgrounds 
are involved, this picture may provide a dramatic underestimate 
of their potential level of performance under more favorable 
circumstances . 

Dynamic assessment methods aim to go beyond this state of 
affairs by assessing the operation of basic psychological 
processes presumed responsible for acquisition of the information 
requ»;sted on standard tests. Some children may not have acquirud 
the Information or skills being assessed, but nonetheless may be 
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able to do so quite readily if given the opportunity. The future 
academic performance of children in this category would be 
expected to be better than one would expect on the basis of their 
initial, unaided static test performance. To generate this 
additional diagnostic information, developers of dynamic 
assessment methods ha^e used a number of different techniques, 
all of which xnvolve the provision of some form of help to the 
chile'. This aid can take the form of modifying the format in 
which the test is administered (e.g., Carlson & Wiedl, 1978, 
1979), providing direct instruction in metiiods of solving the 
problems (e.g., Budoff, 1974), or at Tempting to evaluate directly 
a set of target processes (e.g., Feuerstein, 1979). The 
assumption is that performance estimates obtained under these 
altered conditions will provide more accurate assessments of 
individual differences than standard test scores, or will at 
least supplement the picture they paint. 

Al ♦■hough sharing common assumptions, the methods that have 
been advarced differ in a number of ways, including the g^al of 
the program. Some aim to engineer maximal levels of performance; 
ethers seek to measure the magnitude of response to instruction; 
still others focus on the efficiency of operation of specific 
cognitive processes. Different program goals have resulted in 
different methods of conducting the assessment. In this chapter, 
we outline our own variations on the theme of dynamic assessment. 
The approach we have adopted has been influenced by two lines of 
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research, one specifying the format of the assessment itself and 
the second identifying the target processes we seek to evaluate. 
A General Framework 

Our approach to both assessment and instruction has been 
heavily influenced by Vygotsky (1978) and neo-Vygotskians 
currently working in the Soviet Union on the development of 
assessment techniques for recognizing academic delay (Vlasova & 
Pevzner, 1971; Zabramna, 1971). Both the Soviet investigators 
and our team have been influenced by Vygotsky *s general view of 
learning and development and his notion of a zone of proximal 
development. We emphasize, however, that the resultant approach 
is an amalgam of our views on cognition and instruction and 
Vygotsky's theory; and it is in no way meant to represent 
Vygotsky »s original views unchanged (Brown & French, 1979; see 
Mirsch chapter). Vygotsky emphasized that much of learning was 
mediated through social interactions. Children experience 
cognitive activities in social situations and come to internalize 
them gradually over time. At the outset, the child and an adult 
work on together, with the adult doing most of the work and 
serving as an expert model. As the child acquires some degree of 
skill, the adult cedes the child responsibility for part of the 
job and does less of the work. Gradually, the child takes more 
of the initiative, and the adult serves primarily to provide 
support and help when the child experiences problems. 
Eventually, the child internalizes the initially joint activities 
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and becomes capable of carrying them out independently. At the 
outset, the adult is the model, critic, and interrogator, leading 
the child toward expertise; at the end, the child adopts these 
self-regulation and self-interrogation roles. It is this gradual 
transfer of control that we seek to capture in ou assessment and 
instructional sessions. 

Within this context, Vygotsky also described the zone of 
proximal development, which refers to tha distance between the 
level of performance a child can reach unaided and the level of 
participation that can be accomplished when guided by a more 
knowledgeable participant. For a certain child, in a particular 
domain, this zone may be quite small, the interpretation being 
that the child is not yet ready to participate at a more mature 
level than his unaided performance would indicate. For another 
child in that domain, or that child in another domain, the zone 
of proximal development can be quite large, indicating that with 
aid, sometimes minimal aid at that, the child can participate 
much more fiilly and maturely in the activity than one might 
suppose on the basis of only unaided performance. 

The assessment process suggested by Vygotsky has been quite 
influential in the diagnostic testing of problem learners in the 
Soviet Union (Brown & Ferrara, 1985; Brown & French, 1979; 
Campione & Brown, 1984; Wozniak, 1975). This process involves an 
initial assessment of competence, followed by instruction on the 
target task(s). Children with high degrees of readiness (broad 
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zones of proximal development) within 'chat domain should benefit 
considerably from the intervention, whereas those with less 
readiness will not perform much better with this help than they 
did prior to it. As with other aporoaches, this measure of gain 
is presumed to possess greater predictive utility tl.an the 
initial, unaided level of performance. 

This framework has guided our work in both assessment and 
instruction. In this chapter, we will describe three sets of 
experiments that form parr of an overall program of research with 
two major goals: (a) the development of diagnostic methods of 
assessing individual differences in students' readiness to 
perform in traditional academic domains; and (b) the use of the 
resulting information to guidn the design of instructional 
programs that enhance the academic performance of students 
exhibiting relatively poor performance. In addition to the 
Vygotskian influence, they all involve aspects of dynamic 
assessment. Despite these similarities, the series also differ 
from each other in important ways. The differences arise because 
the studies are addressed to different issues within the present 
enterprise, including some that are primarily of theoretical 
interest and others that involve both theoretical and practical 
issues. Before proceeding to the specific studies, we will 
review some of the considerations that influenced our specific 
choices. 
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Background 

To put the overall research effort into context, we will 
describe the issues that have attracted our interest over time 
and the considerations that led us to this particular approach 
We have long been concerned with the diagnosis and remediation of 
weak students' academic problems. To do this, we need: {a) to 
identify the students likaly to experience difficulties; (b) to 
analyze the academic domain in question in terms of a theoretical 
specification of the skills underlying successful performance; 
(c) to apply methods of assessing the individual's competence 
with those skills; and (d) to implement instructional methods for 
overcoming whatever deficiencies may be revealed through the 
assessment process. 

As with many others, we have been less than optimistic about 
the role standard ability tests can play in this overall 
endeavor. In the next sections, we outline several reasons for 
this concern. The first involves a contrast between two 
different kinds of diagnostic procedures. This is followed by a 
more detailed analysis of the structure of standard tests. 
Having then described our reasons for adopting dynamic assessment 
methods, we distinguish two distinct uses we have made of the terc 
Forms of Diagnosis 

With regard to diagnosis, there are two levels at wh^ch the 
enterprise can be evaluated, one mainly aimed at identification 
and the second more concerned with prescription . In the former 
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case, we might be concerned with identifying the students who are 
likely to experience difficulties, thus indicating the n--d for 
particular attention, A more valuable diagnosis would also be 
prescriptive; it would specify in detail the reasons for the 
problem, thus indicating both the need for, and direction of, 
remedial atteiapts. Although both identification and prescription 
are valuable, prescription enables us to work toward the second, 
instructional goal. 

This distinction highlights the strengths and weaknesses of 
stanHard intelligence and ability tests. Under sol - 
circumstances, they do provide information that contributes to 
the identification goal, i.e., they can indicate students who are 
likely to experience problems; however, even this success has its 
limitations. Of more importance, standard tests have been much 
less successful at meeting the prescriptive goal. In the next 
section, we review some hypotheses about the sources of the 
specific limitations of those tests. 
L imitations of Standard Test Procedures 

One immediate question which arises is why there is a need 
for dynamic assessment approaches. Our goal is to link diagnosis 
and remediation; however, that goal is by no means novel. 
Standardized intelligence and ability tests were intended to 
identify individuals with acadeiric problems and many were 
designed to provide "profiles" of ability that should allow a 
somewhat detailed analysis of the strengths and weak lesses of 
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individuals. From such a picture, it should be possible to 
prescribe interventions tailored to the needs of particular 
students or groups of students. Such approaches, however, have 
not yielded much in the way of encouraging results (Brown & 
Campione, in press; Mann, 1979), and there are several reasons 
why this might be ^he case: 

Product-based m ature . Standard tests analyze the student's 
current level of performance but provide no direct evidence 
regarding the processes that nay have operated or failed to 
operate to bring about that performance. As such, they provide 
at best a partial vi^w of the testee's status. A nice statement 
of this point was made by Vygotsky, who n<5ted that static last 
scores do not provide any information about 

those functions that have not yet matured but are in 
the process of maturation, functions that will mature 
tomorrow but are in the embryonic stage. These 
fuactions could be called the 'buds' or 'flowers', 
rather than the fruits of development. The actual 
developmental level characterizes mental development 
retrospectively, while the zone of proximal 
development characterizes mental development 
prospectively. (Vygocsky, 1978, pp. 86-87) 
It is not that developers of SLandard tests are unconcerned with 
process — they do interpret the results in terms of sets of 
processes — bu. rather that their approach is to infer the 
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processes underlying test performance from analyses of the 
structure of the test results themselves. Given their nature, 
standard tests rest heavi.^ y on the assumption that all testees 
have had comparable backgrounds and opportunities to acquire the 
information requested. This assumption is particularly suspect 
for students from minority or disadvantaged backgrounds. With 
such poDulations, abilities are quite likely to be 
underestimated. The result is that the identification goal of 
the evaluation is jeopardized. 

Level of description . If we turn to the prescriptive 
aspects of assessment, there further problems with the process 
analyses involved in traditional tests. The "profiles" that 
result from such tests and that are used ac the basis for 
description and diagnosis are couched in terms of very global 
aspects of performance (eg., auditory sequencing) that are not 
easily theoretically relatable to interesting academic areas and 
tasks. ?nch diagnoses at best rest on somewhat vague 
abstractions from a particular psychological theory and cannot 
provide the kind of specific information needed to design 
instructional programs. For example, if auditory sequencing were 
diagnosed as the problem, it is not clear how best to intervene. 
Even if such skills can be developed, it is then left to the 
student to determine how and when these skills are to be used in 
academic contexts. 
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Degree of generality assumed . Finally, there is a related 
p.oblem. The profiles that emerge are based on assumptions about 
the generality of the factors inferred from such tests. The 
abilities are presumed to be extremely general ones thac operate 
in many, if not all, academic domains. While domain-general 
skills may well exist, it is also abundantly clear that there are 
important domain-specific capabilities that underlie successful 
performance in different domains, e.g., mathematics or reading. 
The tests available generally do not tap these skills in any 
meaningful way. i-^.iile perhaps obvious, it seems reasonable to 
argue that if one is interested in assessing skill in the area of 
math, the assessment should be situated in the context of math 
problems. Again, the case of such processes as auditory 
sequen'-ing is illuminating. The potential relevance of these 
processes to intervention programs rests on very strong 
theoretical assumptions about the nature of academic 
intelligence. The factors are presumed to be quite general, with 
the result that they affect performance in many situations. 
Improving auditory sequencing, then, would be expected to have 
widespread effects throughout the system. The analogy is with a 
muscle system in which practice on different skills strengthens 
the overall system and thus affords generalized improvement in 
performance. 

Static nature of evaluation ^ Although not a necessary 
feature of standard tests, nonetheless the result of ass^^ssment 
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is frequently taken as providing a relatively permanent 
characterization of the individual in question. The 
classifications that result, already presume^ i to reflect 
"general" academic ability, further tend to be regarded as fixed 
and unlikely to change over long --"^iods of time, A measured IQ 
of 70, for example, is frequently assumed to reflect a relatively 
permanent characteristic of the student in all situations and 
under all circumstances. 
Interpretations of Dynam-^c Assessment 

As this volume is concerned priiuarily with issues regarding 
dynamic assessment, we feel it useful at this point to contrast 
two different ways in which we have used the term. The question 
is, what is dynamic about dynamic assessment? Although in both 
cases the important distinction is between static ana dynamic 
properties of the assessment process, they differ in what, within 
the procedure, is regarded as dynamic—that being assessed, or 
the assessment itself. In the moia traditional usage, the one we 
have already described, the interest is in assessing the 
efficiency of operation of the psychological p rocesse s involved 
in growth and change. The interest is not so much in evaluating 
an individual's current state of knowledge or skill as in 
estimating his or her readiness for change. The contrast is 
clearly with standard test procedures in which descriptions of 
individuals are couched in terms of what they currently know 
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about some domain, or alternatively stated, between product- and 
process-based assessments of Individual differences. 

In the second case, we emphasize the dynamic nature of 
assessment Itself — the notion that any assessment needs to be 
continuously re-evaluated as the student begins to acquire skill 
within some domain. Again, this Is a feature of Vygotsky's 
treatment of the zone of proximal development. His argument Is 
that Instruction creates this zone; hence, with Instruction, an 
Individual's zone of proximal development changes, and It becomes 
necessary to continually update the diagnosis If Instruction is 
to he appropriately directed. The assessment of an Individual's 
zone, or readiness. Is assumed meaningful for only brief periods, 
as one's readiness can Itself change with practice and/or 
Instruction. In this vein, we have also atcempted to construct 
situations in which the assessment Itself is dynamic rather than 
static, cases where the evaluator continually refines the 
dlagno5;ls of the learners as they acquire competence. Thus, we 
use the phrase dynamic assessment to refer to: (a) assessment of 
process, or of the dynamics of change; and (b) to the need to 
continually change and refine the diagnosis of the individual 
learner, i.e., the dynamic, constantly changing nature of 
assessment itself. 
An Alternative Approach 

Our approach to dealing with the limitations of standard 
test merhods Involves several features. One is that assessment 
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should evaluate as directly as possible the particular processes 
underlying successful performance. The second is that the 
assessment should ideally be situated with in £ specific domain, 
rather than being aimed at "general intellectual functioning." 
This in turn increases the likelihood that the processes can be 
specified in sufficient detail that instructional prescriptions 
can be designed. Finally, we make explicit the assumption that 
any diagnosis may have a very short half -life, and thac re- 
diagnosis^ must be an integral part of any resultant intervention. 
Having decided to concentrate as directly as possible on 
ocess is only a first step—it is still necessary to specify 
the process(es) to be evaluated, and then to determine how to 
situate that assessment. 

Target processes . In our work thus far, we have 
concentrated on the role of quite global learning and transfer 
processes; the long-term goal is to be much more specific about 
the factors underlying individual difierences in learning and 
transfer. In initial studies, we looked at the extent to which 
these global processes were related to overall academic ability. 
In more recent studies, we have concentrated on learning and 
transfer processes assessed within specific domains. In effect, 
we have assumed that estimates of individuals' learning potential 
and transfer efficiency within some domain provide measures of 
their readiness to perform in that domain. 
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This view emerged from a long series of studies with 
scholastic lily weak students, frequently labeled as learning 
disabled or mildly retarded. In that work, we concluded that in 
a variety of problem-solving situations, those students had 
difficulty learning new information (required complete and 
detailed instruction to do so) and were relatively unlikely to 
use that information flexibly in new problem situations (Brown, 
1974, 1978; Brown & Campione, 1978, 1981, 1984; Campione & Brown, 
1977, 1978, 1984; Campione, Brown & Ferrara, 1982), 

Methods of assessment . Although that seems reasonable 
enough, at the time we began this program, the bulk of the 
available evidence dia not support the position that assessments 
of learning ability or transfer flexibility would provide much 
helpful information about individual students (see Campione et 
al., 1982; Campione, Brown & Bryant, 1985). The question is how 
one might reconcile the disparate sets of findings. We have 
outlined our hypotheses in other sources (e.g., Campione & Brown, 
1984), and will summarize them here. The major argument is that, 
in the studies generating negative findings, the estimates of 
learning and transfer efficiency were obtained: (a) in asocial 
learning situations; (b) involving only minimal feedback from the 
evaluator, most freq uently simple feedback about the correctness 
of individual responses; and (c) situated in arbitrary domains. 
Tl»e netrics of learning and transfer were the amount of time 
aiid/or the number of trials needed to bring about learning. 
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As an example of this research approach, consider some 
comparative studies reported by Woodrow comparing the learning 
(Woodrow, 1917a) and transfer (Woodrow, 1917b) performance of 
groups of retarded and nonretarded children with mental ages of 
around ten years. The learning tasks he used involved a 
^geometrical form sorting task in which the children were required 
to sort five forms into different boxes. They sorted 500 of 
these a day for 13 days, guided at best by feedback about the 
correctness of their individual placements. The main index of 
learning was the increase over time in the number of forms 
sorted. The transfer tasks consisted of two new sorting tasks 
(lengths of sticks and colored pegs) and two cancellation tasks 
(letters and geometric forms). Using these tasks, Woodrow found 
no differences whatsoever between the retarded and nonretarded 
groups in either learning or transfer performance. In these 
studies, learning and transfer were seen as passive, asocial, 
extremely general processes that could be tapped in any task 
domain. These conditions were typical of many studies failing to 
find evidence that learning and/or transfer processes represented 
important dimensions of individual differences (see Woodrow, 
1946, for a review). 

In contrast, the more recent studies, those yielding 
positive results, are characterized by a concern for structured 
intervention, often involving complex social interaction. The 
problems to be learned are set in non-arbitrary domains, i.e., 
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ones where there are rules for the students to learn and where jt 
is possible to come to understand why certain responses are 
appropriate in given situations and not in others. This 
understanding then erves as the basis for subsequent use of the 
newly acquired i formation, i.e., principled transl..^. is possible 
(Brown & French, 1979; Campione & Brown, 1984). The metric of 
learning or transfer efficiency is the amount of help needed for 
a student to acquire a rule or procedure. 

Given this analysis, we assumed that if we wished to assess 
individual differences in learning and transfer that would be of 
diagnostic significance, we would have to match these latter 
conditions. The learning should be guided by the adult tester 
and shcald involve the acquisition of rules or principled whose 
application in novel contexts we could subsequently observe. 
These ideas clearly meshed nicely with those of Vygotsky (1978), 
and our procedures have ended up being quite similar to those 
employed by neo-Vygotskians in the Soviet Union. 

The studies we have conducted follow the same general 
format. They begin with an evaluation of children's initial 
competence. Following this, they are placed in a mini-learning 
environment where an adalt (or a computer) works collaboratively 
with them until they are able to solve sets of problems 
independently. If they are unable to solve a particular problem, 
they are given a series of hints to help them. The initial hints 
are very general ones, and succeeding ones become progressively 
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more specific and more concrete, with the last "hint" actually 
providing a detailed blueprint for generating the correct answer. 
This titration procedure allows us to estimate che minimum amount 
of help needed by a given child to solve eaC problem. The 
metric of learning efficiency is the number of hints required for 
the attainment of the learning criterion (typically two 
successive problems solved with no help). Note that the metric 
here differs from that used by several others interested in 
dynamic assessment, including Vygotsky, in that it is not how 
P^^ch improvement one can bring about, through intervention, but 
rather how much aid is needed to bring about a specified amount 
of learning. 

Exactly the same hinting procedure is used on the transfer 
problems, generating the analogous metric. Note that the index 
of transfer propensity is thus a dynamic, rather than static, one 
(Brown, Bransford, Ferrara & Campione, 1983). That is, we do not 
measure how many and what types of transfer items individuals can 
solve on an unaided test (a static measure), rather, we are 
concerned with how facile they are in coming to deal wi.'i related 
portions of the overall problem space (a dynamic measure) — 
specifically, how many hints they require to solve the various 
types of transfer problems. Following these instructional 
sessions, a post-test is given, and the gain brought about by the 
instruction determined. 
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To summarize, we decided to situate our assessment of 
learning and transfer efficiency in social interactional contexts 
in which the evaluator would be engaged in the task of teaching 
the children how to solve sets of problems; the measures of 
learning and transfer could then be based on students' responses 
to that instruction. 

There is one further point to emphasize. The hints employed 
were based on a detailed tcsk analysis and were designed such 
that each one would provide more specific information than the 
previous one(s). These hints were given ?.n a fixed sequence and 
vere, with one exception, independent of the individual child's 
responses (the exception was that if the child had already 
generated the information provided by an early hint, that hint 
was omitted, and the experimenter gave the next hint in the 
sequence). The procedure was then task -, rather than child- 
oriented. This was done because we aimed to produce qt.antitative 
data with good psychometric properties; the amount of help 
indices are likely to have such properties only if the test 
administration is standardized as much as possible. 

The trade-off is with more clinical procedures in which 
assessors vary their questions, or promptr, , with different 
children as those children show different approaches to the 
problems at hand. Such approaches may well provide richer 
information about the skills and aptitudes of individual 
children; however, they are less likely to produce strong 
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quantitative data. As we will describe later, we have attempted 
in some of our more recent scudies to modify our procedures in 
ways that allow us to combine the strengths of the different 
approaches • 

Task domains . The next decision involved the selection of a 
domain in which to embed the teaching. Given an interest in 
transfer propensity, it is necessary to choose a domain in which 
rules or principles can be learned and applied to novel types of 
problems. As we were also interested in academic skills, we also 
wished to choose a domain that was known to be related to school 
performance. In our initial studies investigating the diagnostic 
urtlity of measures of learning and transfer, we worked with 
inductive reasoning problem spaces, variants of progressive 
matrices problems and series completion problems, as performance 
on those tasks is known to be related to scholastic success. 
Further, enough was known about the structure jf those tasks that 
it was possible to design a theoretically-based teaching, or 
hinting, sequence. In our work on instruction emphasizing the 
dynamic nature of the assessment process itself, we concentrated 
on studies of reading and listening comprehension, skills of 
considerable academic importance. 

Specific Studies 
We will summarize the results of three sets of studies. The 
first two involve the theoretical and diagnostic status of the 
learning and transfer measures obtained in our adaptation of the 
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zone of proximal development testing procedures. Iheiie d 
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issues of concurrent validity and predictive validi ty. In the 
first case, we selected students of varying academic ability and 
assessed their performance as they learned how to solve inductive 
reasoning problems. Performance on these problems, featured on 
most ability tests, consistently distinguishes academically 
successful from less successful students. Our expectation, then, 
was that learning and transfer indices, obtained in these 
fiomains, would be related to assessed ability. 

In the case of predictive validity, we wislied to go one step 
farther and e*-luate the extent to which the dynamic measures 
would provide diagnostic information beyond that afforded by 
static ability tests. While ability test performance was 
expected to be related to learning/transfer efficiency in these 
inducti/e reasoning domains, we also expected that the dynamic 
measures would provide more information about the future 
performance of subjects within those domains than would the 
static tests. 

In these studies, instruction is provided, and response to 
instruction is used as a metric of individual students' readiness 
to deal with the domain under study. The concern is with how 
much instruction is needed to bring about a given level of 
performance . 

The third serier, is more "purely" instructional. The goal 
here is to maximize the performance of individual students in 
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important academic domains. One key element of the instructional 
program is the need for continually updating the diagnosis of 
students' current skixl levels, rather than using the initial 
estimate as a long-term index. 
Studies of Concurrent Validity 

Ir these studies, we were interested in the extent to which 
measures of learning and transfer efticiency, obtained within the 
context of prototypic ability test items, specifically inductive 
reasoning tasks, would be related to general ability levels. 
There are two issues involved: (a) regarding diagnosis, do 
either or both measures distinguish lower ability students from 
those of higher ability—an identification issue? and (b) 
regarding theory, can part of the variance in individual 
differences in this domain be attributed to learning and/or 
transfer dynamics — a qualitative issue. 

Campione , Brown, F errara, Jones , and Steinberg (in press) . 
In this study, we used a variant of the Raven Progressive 
Matrices task. At the outset, subjects were given a pre-test 
involving the kinds of problems that were to be used in the 
diagnostic/instructional sessions. Each problem involved a 3 x 3 
matrix wil the lower-right entry left blank; the subject's task 
was to select, from a set of six, the pattern that best completed 
the matrix. The subjects consisted of groups of retarded (mean 
IQ = 72) and nor.retarded (mean IQ = 118) children matched for a 
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mental age of approximately years and lor performance on the 




pre-tes t • 




During the instructional sessions, students worked at a 




computer terminal. In the initial phase, they learned to solve 




problems involving taree rules: rotation, imposition, and 




subtraction. ExampJes are shown in Figure i. During this, the 




learning portion of the study, the problems were presented in a 




Insert Figure 1 about here 




blocked form it. Each student learned the rotation problems 




to a criterion, t^en the imposition problems, and finally the 




subtraction problems (an easy-to-hard sequence). In the next 




(maintenance) session, novel exemplars of the same cype were 




presented, but now in a random order. The ensuing (transfer) 




session included these same problem types interspersed with a set 




of transfer problems; these required the use of combinations of 




the original rules (rotation + imposition; rotation + 




subtraction^. Examples are shown in Figure 2. 




Insert Figure 2 about here 




In contrast to the pre-test procedure (and the standard 




procedure used with the Raven), their task here was to generate 




the pattern needed to complete the matrix by issuing a set of 
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prc-taught commands using a touch-sersitive screen. Graduated 
and animated hints were provided via the computer as needed, with 
on adult reading the hints to the child if necessary and 
providing general encouragement. The hints were presented in a 
pre-set sequence, proceeding from very j^eneral hints offering 
relatively little specific information about the form of the 
solution to very specific hints, which eventually provided a 
detailed blueprint from which the child could generate the 
correct answer. The numbers of hints taken to reach the learning 
criterion and to solve the maintenance and transfer problems were 
the metrics of learning and transfer efficiency. A sample hint 
sequerc^ (for Rotation problems) is shown in Figure 3. 



Insert Figure 3 about here 



No differences were obtained during the learning phase of 
the study, possibly due to the matching procedures that equated 
the groups for both mental age and entering competence. However, 
group differences were apparent during both the maintenance and 
transfer phases of the study. Further, those differences tended 
to increase as the similarity of the training and test contexts 
decreased. The greater the need for flexibility in applying the 
learned rules, the larger were the diflei nces between retarded 
and nonretarded children. Thus, in this study, transfer, but not 
learning, performance did distinguish the different ability gioups. 
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Ferrara, Brown and Camplone (In press) > This study included 
a different ability range and a developmental factor, contrasting 
thiij and fifth grade children of average and above average 
abiJ-fty. A second inductive reasoning task, letter series 
completions, was used. One major difference between the series 
completion and matrices tasks was that in the series completion 
case, a more detailed examination of "transfer distance" was 
included. Tae idea, supported by the results of the Campione et 
al. study, was that individual or group differences would be more 
apparent as the transfer distance, or the difference between the 
learning and transfer situations, increased. While there have 
been some suggestions that, for example, retarded children may 
show "near transfer," they are quite unlikely to show "far 
transfer" (e.g.. Brown, 1978; Campione & Brown, 1977). A nice 
general statement of this notion was provided by Gagne (1970) in 
the course of describing lateral transfer: 

In the case of this kind of transfer, the question of 
how much appears to be a matter of how broadly the 
individual can generalize what he has learned to a 
new situation. Presumably, there are limits to the 
breadth of generalization, which vary with different 
individuals. One could perhaps think of a whole 
range of situations of potential applicability of 
(some learned rules) that display decreasing degrees 
of similarity to the situation in which the rule had 
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originally been learned* At some point along this 
dimension of breadth of generalization, a given 
individual will fail to transfer his previously 
learned knowledge • Another individual, however, may 
be able to exhibit transfer more broadly to a wider 
variety of differing situations, (p, 336) 
While the idea is an attractive one, there are few relevant data 
available, one problem being that there has frequently been no 
objective way of determining transfer distance in the domains 
that have been investigated. The series completion task was one 
that lent itself nicely to this task* Specifically, "transfer 
distance" can be defined in terms of the number of 
transformations distinguishing the learning problems from the 
various transfer items* Examples of the learning, maintenance, 
near transter, far transfer, and very far transfer items are 
shown in Table K 



Insert Table 1 about here 



The child's task is to fill in the blanks with letters that 
continue the pattern that is determined by a certain periodicity 
and by certain alphabetic relations ( Next , the appearance of 
letters in alphabetical sequence; Identity , the repetition of 
letters; and Backward-next , the appearance of letters in reverse 
alphabetical sequence)* On the learning items, the children 
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learned to deal with the Next and Identity relations, and with 
periodicities of two and four. Maintenance items involve no 
transformations, but are simply novel exemplars of the same 
problem types learned originally. Near transfer items involve 
the same principles (r ,lations and periodicities) learned 
originally but in different combinations. Far transfer items 
involve the application of a novel periodicity (three) or 
relation (Backward-next). And Very far transfer items involve 
the use of novel principles in a novel context. 

There was an overall effect of ability during the learning 
phase; high ability children needed fewer hints to learn the 
initial problems than did the average ability group. The 
transfer data, however, were of more interest. The major 
finding, as expected, was that group differences increased as 
transfer distance increased. These results are shown in Figure 
where it can be seen that virtually no aid was required on the 
maintenance items and very little on the near transfer items; 
there were no instances of group differences. However, on the 



Insert Figu; e 4 about here 



far and very far transfer items, group differences were highly 
reliable. The results of a series of correlational analyses 
revealed the same pattern. Correlations between IQ scores and 
number of hints taken were non-significant for maintenance and 
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near transfer, but reliable when far and very far transfer 
performance was considered. 

Campione and Ferrara (in preparation) . The next study 
involved a comparison of retarded and nonretarded children on the 
series completion task. The results were quite consistent with 
those of the first two studies. Group differences emerged on 
both learning and transfer sessions, with the differences being 
larger during transfer. As in the previous studies, the 
nonretarded children performed extremely well on the maintenance 
series, requiring virtually no help to solve those problems; 
however, the retarded students did need experimenter-provided 
hints to deal with those problems. On far and very far transfer 
tests, the differences between the groups again increased 
reliably. 

Overall, the results of these three studies establish the 
concurrent validity of the learning and transfer measures. 
Groups of children of contrasting ability do differ in terms of 
their learning, and particularly transfer, performance. Less 
able children tend to need more help to solve sets of original 
learning problems, and then continue to be at a disadvantage when 
they are required to make flexible use of the rules or principles 
they have been taught. The greater the amount of flexibility 
required, the larger is the difference between the groups. 

The notion of transfer distance does appear to be an 
important one in terms of diagnosis of group differences. The 
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farther the distance, the larger the magnitude of any difference. 
Further, groups of different ability vary in how "far" they can 
transfer before they begin to run into difficulties. In both 
Studies 1 and 3, where retarded children were involved, those 
students began to require help even on the maintenance series. 
Having learned to solve particular sets of problems, they still 
run into difficulties when they are asked to solve problems of 
the exact same type later in a different context. Children of 
average and above average ability, in contrast, handle 
maintenance (and even near transfer) items extremely well^ It is 
only when far transfer problems are given that they begin to need 
significant amounts of help; and only when these far transfer 
problems appear do the average and above average ability groups 
begin to differ. 
Studies of Predictive Validity 

In the next set of studies, we (Bryant, 1982; Bryant, Brown 
& Campione, 1983): (a) attempted to extend the previous results 
to younger children, five-year-olds; and (b) addressed the issue 
of whether the learning and transfer measures do provide 
additional diagnostic information about individual subjects 
beyond their standard ability scores. The ideal way to evaluate 
predictive validity would be to have measures of initial 
competence, along with measures of general ability and learning 
and transfer scores. Then, at some later p^^int in time, we could 
re-assess the students' ability. The question would then be 
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which score(s) best predict later performance — initial 
competence, general ability, or the learning and transfer 
indices. In these studies, our general procedure was to give 
subjects a pretest, learning and transfer sessions, and then a 
final postte?t. The transfer sessions included maintenance, near 
transfer, and far transfer items, again defined in terms of the 
number of transformations distinguishing the transfer probes from 
the learning items. The pretest included both evaluations of 
general ability (subscales from the WPPSI to generate an overall 
IQ estimate and the Raven Coloured Progressive Matrices) and a 
task-specific pretest. In the latter, baseline levels of 
performance on the items to be included in the learning and 
transfer sessions were obtained. The posttest was a re- 
administration of this pretest, and our major interest was in the 
gain that resulted as a consequence of the instruction afforded 
in the learning and transfer sessions. 

Two separate studies were conducted, one involving a 
simplified version of the matrices task (Bryant, 1982) and the 
second a simplified version of the series completion task. The 
major results are shown in Tables 2 and 3, which portray a series 
of multiple regression analyses. The first thing to note is that 
there are significant relations between the ability scores and 
the learning and transfer metrics, thus replicating the results 
of the previous studies. Children of higher ability tend to 
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require fewer hints to solve the original sets of problems, and 
further require fewer hints to deal with the transfer problems. 



Insert Tables 2 and 3 about here 



Of more interest are the results of the analyses of the gain 
scores. In these analyses, the effects of the estimated IQ score 
and the Raven score were extracted first. In both studies, they 
did allow a reasonable prediction of the gain score, accounting 
for around 60% of the variance in that score. Even after the 
effects of the ability scores were extracted, however, the 
learning and transfer scores otill accounted for significant 
additional portions of the variance in gain scores; thus, taking 
the learning and transfer scores into account did provide further 
diagnostic information about individual children. In the 
matrices task (Table 2), the learning score accounted for an 
additional 22% of the variance, the transfer score for a still 
further 17%. In the series completion task, the learning score 
did not result in an increase in predictability of gain scores, 
but the transfer score did account for an additional 22% of the 
gain variance beyond the ability scores. Alternatively, if one 
looks at the simple correlauions , the learning and transfer 
scores are better predictors of gain score than either of the 
static ability measures. Finally, within the set of dynamic 
measures, the tendency is for the transfer measures to be more 



ERIC 



32 



Dynamic Assessment 
33 

strongly associated with gain scores than the learning index. 
Vhis is consistent with the findings ^"'.om the earlier series 
where ability group differences were larger on transfer than 
during learning. 

The studies reviewed thus far establish that the dynamic 
assessment measures do provide diagnostic information about 
children and can play a role in the identification component of 
diagnosis. But what about the prescriptive component, i,e., do 
they suggest any particular sources of problems to which 
instructional programs might be geared? The answer is, "Yes, to 
some extent." Throughout the series of studies, the largest and 
most consistent effects have had to do with aspects of transfer 
propensity. Transfer measures were most strongly related to 
ability measures — it is transfer that best discriminates various 
ability groups. And transfer flexibility is also the best 
predictor of gain scores. Our best overall description of 
differences between more and less successful students would then 
be in terras of the processes underlying the judicious application 
of acquired skills to the solution of novel problems. The 
suggestion is that any programs that are designed for use with 
academically weak students must deal with the transfer issue. It 
is not sufficient to plan instruction in such a way that rules 
and principles are learned to some criterion; it is also 
necessary to attempt to provide these tools in a context that 
stimulates students* ability to use them with some flexibility. 
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Although this is a general suggestion, and one that has been 
around for decades, i.e., ceachers are frequently told to "teach 
for transfer" even if not taught how to do so, what makes it more 
than a platitude is the fact that some of the instructional 
principles that are effective at inducing transfer have been 
identified (e.g.. Brown & Campione, 1978, 1981; Palincsar & 
Brown, 1984) and shown to be effective. These include training 
in multiple settings, attention to the metacognitive environment 
of instruction — making the student aware of the skills being 
taught and of the need to actively monitor and regulate them, and 
the range of applicability of those skills— teaching the skills 
in the actual context in which they are to be used, rather than 
as isolated skills, etc. Further, many programs designed for 
weak students intentionally ^ not include such components; the 
idea is that for such students instruction should concentrate on 
making sure they "know the basic tacts," a form of mastery 
learning that leads to a concentration on drill aimed at 
perfecting individual skills, quite the ooposite of the 
conclusions we have reached. To buttress this argument, in 
Section D, we review a program of research that has embodies 
these features and that has produced impressive results. That 
work has included the general suggestions mentioned here along 
with more specific suggestions that followed, once the particular 
domain in question had been specified. Before turning to the 
instructional work, however, we would like to indicate the ways 
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in which we are attempting to improve the basic assessment 
procedures. 

Current and Future Is Fues Regarding Assessment 

The initia] results obtained in our adaptation of Vygotsky's 
zone of proximal development approach to absessment have been 
encouraging. Over a series of studies involving different tasks 
and subjects of widely varying ages and abilities, the learning 
and transfer metrics have consistently provided useful 
information about students. The'jare related to ability measures 
(which are themselves predictive of academic success), but also 
provide additional information not captured by those tests. The ^ 
have fiso consistently led to an emphasis on transfer processes 
as sources of individual differences, and hence to suggestions 
about the design of intervention programs aimed at weak students. 
Our ongoing studies include attempts to improve on the diagnostic 
properties of the dynamic measures, in terms of both the 
identification and prescriptive goals. We are also extending the 
procedures to richer, more academically relevant tasks for both 
practical and theoretical reasons. 

The role of personality factors . One line of research 
attempts to improve on the predictive power of dynamic measures 
by adding information about individuals* attribution styles. The 
assessment is carried out in a social, interactional system where 
en expert and a novice work together to solve sets of problems. 
In this situation, students ask for, and are given, help as 
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needed. It is unlikely that t^eir responses in such a social 
situation are determined by purely "cognitive" factors. On a 
general level, there appeared to be clear differences between the 
ways young children (five-year-olds in Bryant's work) and the 
elderly (French, 1979) responded to the hinting procedures. The 
elderly appeared threatened by the need for hints and interpreted 
them as indicating that they were failing on the task; in 
contrast, the young c ildren appeared more willing to accept the 
help and still feel that they had solved the problems themselves. 
There is also some evidence that different children interpret the 
input in different ways, some seeking it frequently and others 
doing everything they can to avoid asking for help so they can in 
fact be allowed to solve the problems themselves. This, along 
with some of their spontaneous verbal comments, indicates that 
children adopt either learning or performance goals (Dweck & 
Elliot, 1983; Dweck & Bempechat, 1984) in the task that lead them 
to react differently to the need for aid. If we could assess 
those orientations, we should oe in a better position to evaluate 
their performance during the assessment sessions. To collect 
some relevant data, we have re-done the Campione et al. (in 
press) matrices study with a l^rge group of fourth graders. We 
have also administered these students a pair of social comparison 
questionnaires — Crandall, Katkovsky and Crandall's (1965) 
Intellectual Achievement Responsibility Scale and Harter's (1983) 
Perceived Competence Scale: Revised Version. The hypothesis is 
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that by taking into account individual's attributions and 
orientations, more accurate predictions about future performance 
can be obtained. 

Qualitative analyses . We are also attempting to generate 
richer descriptions of individual students and more detailed and 
prescriptive pictures of the differences between sue issful and 
unsuccessful students. We would like to be able to get rich 
qualitative descriptions of students' approaches to the problems 
while maintaining the standardized format that has produced 
useful quantiative data, i.e., to merge the psychometric and 
clinical approaches . 

One approach that is particularly promising involves having 
students "talk-aloud" about their approaches as they work on the 
problems. The initial attempt here has shown that fourth graders 
can handle this requirement quite well, and it appears that 
adding this component does not materially change the ways in 
which they approach the problems. (It is also the case that some 
five-year-olds provide spontaneous talk-alouds during the testing 
sessions; the social interactional nature of the assessment 
process seems to support this nicely. When these talk-alouds do 
occur, they are quite informative about individual children's 
approaches and supplement the quantitative data in interesting 
ways.) Although these data are not fully analyzed, these talk- 
alouds do appear to provide useful information. For example, 
successful students tend to spend a considerable amount of time 
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"plaanirg** their moves; they talk about what the answer should 
look like before they begin to construct their own answer or 
bef' e the consider the alternatives from which they have to 
select. Less successful subjects tend to begin their 
construction activities without fully analyzing the problem; they 
tend to misclas.<;if y problems and proceed in an unvsystematic 
fashion to construct an answer. There are also differences in 
the ways in which successful and unsuccessful students recover 
from errors, or from false starts that do not lead to problem 
solution. We are confident that this information could be used 
to design more powerful and individually tailored programs of 
instruction. 

Extension to academic domains . Finally, having shown that 
our procedures do work, we are extending them to richer and 
academically more interesting domains — initially early 
mathematics. There are several reasons for doing this. 

First, pragmatically, this domain is one of clear 
educational significance, and given that the procedures we have 
used require a large amount of effort to develop, it makes sense 
to situate that work in such an area. Assessments of individual 
readiness are of more immediate interest if obtained in math than 
in inductive reasoning domains; and the leap from diagnosis to 
suggestions for the design of instructional programs is shorter 
in the case of math. 
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Second, of more theoretical interest, theie have been a 
number of detailed analyses of the structure and development of 
mathematical knowledge. This work makes it possible to obtain a 
reasonably thorough picture of students' mathematical knowledge 
before they enter the assessment situation. This is important 
because it is only if we can assess the quality of an 
individual's knowledge In some area that we can clearly evaluate 
the differential contributions of content knowledge and 
learning/transf er dynamics to the assessment process. For 
example, some might argue that the differences in learning and 
transfer efficiency which are uncovered in our studies are 
actually no mora than manifestations of individual differences in 
content knowledge. Unless we have a good measure of that 
knowledge, it is difficult to refute that claim. In some current 
work, Ferrara, as part of her dissertation research, is working 
on the development of a test of earl math knowledge. She is 
also designing hinting procedures that can be used with simple 
addition and subtraction problems. With these assessments of 
knowledge and learning/transf er efficiency in hand, it will be 
possible to assess the predictive properties of tha c'ynamic 
measures when students are equated for their entering knowledge. 

This leads to the last point. We are interested in devising 
measures that can predict students* furr.re trajector-' es . The 
success of such an enterprise can best be evaluated in an area 
where there is room for a large amount of improvement; inductive 
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reasoning problems of the type we began this research with do not 
serve this purpose well. In contrast, mathematics, even early 
mathematics, is an area where there is considerable room for 
improvement; as such, we can track the progress of students over 
long periods of time while they are acquiring increasingly 
sophisticated sets of skills. In this way, stronger tests about 
the utility of dynamic assessment procedures can be arranged. 
Instructional Design : The Dynamic Nature of Assessmen t 

In this section, we describe the highlights of a program of 
research that has concentrated more directly on instruction, and 
instruction in a particular academic domain. The concentration 
on a specific domain makes it somewhat easier to specify in more 
detail the skills distinguishing strong from weak students. The 
goal was to improve the reading and listening comprehension 
skills of students experiencing particular difficulties with that 
task. In the main studies (Brown & Palincsar, 1982, in press; 
Palincsar & Brown, 1984), the students were seventh grade 
students of relatively low overall ability (IQs ranged from 
60-100, "h a mean of around 80) whose reading comprehension 
scores lagged one to four years behind those of their age- and 
grade-mates . 

The general design of the instructional sessions was based 
on the same Wgotskian principles that guided the development of 
the assessment procedures. We sought to mirror in the teaching 
situation the gradual transfer of control of cognitive skills 
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that Vygotsky described. The teacher and students would begin by 
working together, with the teacher initially doing most of the 
work. As the children began to acquire the target skills, they 
were encouraged to take on more and more responsibility until 
they were eventually able to employ the skills when working 
independently. 

More specific features of the instruction was based on a 
considerable amount of prior research indica^-ing that one majc- 
difference between skilled and unskilled comprehenders lay in the 
kinds of active comprehension strategies (both compreV>ension- 
fostering and comprehension-monitoring — see ?al±v ^.p : & Brown, 
1984, for additional description) they brought to the task of 
reading for me^n^ ^, Specifically, good readers, in the course 
of studying a text, tend to: (a) stop and summarize what they 
have read periodically; (b) formulate questions that capture the 
main idea of what they have just read; (c) attempt to clarify any 
inconsistencies that appear; and (d) predict what the author will 
go on to say. The instructional program that Palincsar and Brown 
developed was designed to teach these four strategies. Ou • 
interest here is with only a portion ut the overall program, the 
way in which assessment of student capabilit^.es is integrated 
into the overall framework , 

If one were to consider the students with comprehension 
problems and diagnose their competence in the use of the four 
activities just listed, it would turn out that all would 
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essentially fail the test. There is little evidence of their 
using these activities without explicit instruction. Further, 
when asked to engage in the activities, they do so very poorly. 
We have then a reasonable diagnosis about the sources of their 
problems. The question is, what do we do about it? How strongly 
should that diagnosis affect instruction? And how long should 
that diagnosis be retained? The procedures that Palincsar and 
Brown developed, termed reciprocal teaching, provide some 
insights into these issues. 

We do not have the space here to provide thj details of 
their approach. The main point for our purposes is that in the 
teaching sessions, the teachers engaged in constant on- _ine 
diagnosis and re-diagnosis of each student's current l.vel of 
skill. This was possible because the teac!^ing method forced each 
student to produce the key activities overtly. When the group 
was engaged in reading a text, the teacher and students took 
turns leading a dialogue about the text segment they had just 
read. The leader of that dialogue was required to summarize what 
had just been said and formulate a question about the main point 
of the section. When appropriate, they were also told that they 
should seek to clarify any inconsistencies or confusions that 
arose, and to predict what mi^ht happen next. 

Several features of this interaction are important; they 
were included to maximize the likelihood that transfer of the 
target strategies to an array of academic tasks would result. 
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First, note that the students engage in the target activities in 
the context of actually reading and understanding texts. It is 
also made clear to them what those activities are, why they are 
useful, and where they can be applied. Further, as the students 
carry out the activities, the teacher is able to see how well 
they are executed and diagnose wuat individuals' current 
problems, if any, with the particular skills are. In this way, 
feedback can be provided to each student tailored to particular 
needs at the moment. 

Over time, as student competence increases, the teacher's 
diagnosis changes, and different types of feedback are provided 
requiring more advanced responding from the student. In this 
way, the student is gradually led to master the various 
activities, until eventually an acceptable level of skill is 
reached. The teacher begins by doing a large part of the work 
for the student, but as the diagnosis changes, progressively more 
work from the child is required until the teacher can eventually 
fade out, leaving the student to perform unaided. Our point heie 
is simply that the initial diagnosis (the students do not engage 
in these activities) needs to be constantly updated, so that the 
teacher can respond appropriately to the students' needs at any 
point in time, and thus provide the kind of input necessary to 
move them one step further toward independent competence. 

To see how this works in practice, we can consider a 
classroom teacher interacting with two remedial seventh graders— 
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her interactions are quite different in the case of Charles, who 
izakes a very weaR beginning, and Sara, who has a clear (but 
inadequate) idea of how Lo ask questions concerning texts, 
(Charles, IQ = 70, Reading Comprehension = third grade; Sara, IQ 
= 84, Reading Comprehension = fourth grade.) 

Charles > The group is reading a passage about American 
snakes. Charles has a great deal of difficulty taking his turn 
leading the dialogue, primarily because he doesn't know how to 
formulate an appropriate questior (see Table 4), "What is round 
in the Southeastern snake, also the copperhead, rattlesnakes. 



Insert Table 4 about here 



vipers— they have—I'm net doing this right." The teacher 
responds to his difficulty and tells him the main idea. "Do you 
want to ask somethiia about the pit vipers?" When he still fails 
to ask an adequate question, she prompts, ''Ask a good question 
about the pit vipers that starts with the word why." When he 
still cannot manage it, she models, "Why do they call the snakes 
pit vipers?" After two tries, he copies the teacher's question 
and she provides praise and encouragement. Even imitating a 
fully formed question is difficult for Charles initially. 

Four days later Charles is still having difficulty asking 
questions on a passage about spiders. The teacher models one for 
him, but this time she w^its for him to find the main idea 
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himself and attempt to make up a question, "How do spinner's 
mate spend most of his time sitting?" The teacher responcls, 
"You're very close. The ( lestion would be, 'How does spinner's 
mate spend most of his time?' Now you ask it." And he does. 

Seven days into the procedure, Charles can mske up questions 
with a little help pinpointing main ideas and by the eleventh day 
he takes his turn as teacher with two questions, "What is the 
most interesting of the insect eating plants," and "Where do the 
plants live at?" After fifteen days he produces acceptable 
questions each time it is his turn to lead the dialogue. Charles 
- "Why do scientists come to the South Pole to study?" Teacher - 
"Excellent question— that ' s what the paragraph is all about!" 

Sara. In contrast to Charles, another student in the group, 
Sara (see Table 5) has a clear idea of what kinds of questions occ 
in schools— "fill in the blanks." The tpacher, preoccupied with 
Charles, tolerates such questions until the second day and then 
attempts to take Sara beyond this level. Sara - "Snakes' 



Insert Table 5 about here. 



backbones can have as many as 300 vertebrates - almost 
times as many as humans?" Teacher - "Not a bad beginning, but I 
would consider that a question about a detail. Try to avoid 
'fill in the blanks' questions. See if next time you can find a 
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main idea question and Degin your question with a question word - 
how, why, when • • 

On the third day, Sara comes up with a main idea question, 
but this time she selects a line in the text, "several varieties 
of snakes live all their lives in the sea," and turns it into a 
question, "Can snakes live their whole lives in seas?" The 
teacher again increases her demand and asks, "See if you can ask 
a question using your own words." For the remainder of the 
sessions, Sara composes questions in her own words becoming more 
and more like the model teacher in her turn. 

The teacher's responses to Charles and Sara are different, 
and this variation appears to dovetail well with their entering 
skill levels and rentes of improvement. As the teacher diagnoses 
their growing levels of competence, she asks more and more of 
them until they eventually generate good questions with no 
teacher guidance. Notice that they are never ask-^d to make a 
large jump, never asked to move quickly to unaided performance. 
Rather, they are gradually guided to that level, something that 
can occur only if the teacher continues to update her assessment 
of their evolving capabilities. 

This program has produced impressive results in a number of 
replications ranging from experimental studies involving small 
groups throuph larger-scale studies involving classroom 
instruction conducted by teachers with their regular, and 
frequently large, reading groups. We will highlight some of the 
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major gains here (see Palincsar & ^rown, 1984, for more detail), 
(a) Throughout the period during which instruction was provided, 
students took daily tests on their ability to read a science or 
social studies passage and then ansv^er from memory ten 
comprhension questions. Instructed students' performance on 
these tests begins at around 30-40% correct and improves steadily 
until they are consiste.^tly scoring 80% correct, (b) There is 
also evidence that their newfound skills are being transferred to 
classroom activities. For example, in one study, all seventh 
graders in the school (approximately 140) took regular exams, 
consisting of reading passages and answering comprehension 
questions, in their science and social studies classes. At the 
beginning of the intervention, the students in the reciprocal 
teaching ^rroups scored at around the 15th percentile; by the end 
they nad moved up to above the bOth percentile. (c) They shower^ 
evidence ot transferring some of the trained skills to 
laboratory-based tests. There were significant increases in 
their ability to detect text inconsistencies, generate quections 
probing the main idea of the passages they read, and write 
summaries of portions of assigned texts. And (d) their 
standardized reading comprehension scores increased 
significantly — by an average of just over two years. 

Although none of the students showed evidence of using the 
target activities spontaneously at the outset of the studies, and 
some had extreme difficulty producing them when initially 
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instructed to do so, the teacher was able to monitor the 
improvement that did occur and provide the kind of practice and 
feedback needed to continue tnat improvement. As a result, the 
students did learn to use the skills indc jendently and flexibly, 
leading to worthwhile improvements in their ability to read and 
understand texts • 

Summary 

We have reviewed several lines of research incorporating 
features of dynamic assessment. In that research we have used 
dynamic assessment to refer to two distiact sets of artivities, 
one emphasizing the -'iew that assessment attempts should be aimed 
as directly as posiiible at the processes undarlying successful 
performance on academic tasks, and the second that the assessment 
itself should be continuously updated. Our stud^ss conducted 
thus far have shown that the measures of learning and transfer 
efficiency that we generate in our adaptation of Vygot sky's zone 
of proximal development tr^isting procedures do possess both 
concurrent and predictive validity. They have also indicated 
that the best predictors of the extent to which individuals are 
likely to profit from instruction are their initial responses to 
instruction and, even more sensitive, the extent to which they 
can transfer their newly learned skills to novel situations. 

In the context of instruction, we have argued that whereas 
early diagnoses can provide important information about: the kinds 
of educational programs needed with weak students, those 
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diagnoses need to be continuously updated if they are to 
contribute meaningfully to ins*-ructional goals. Diagnosis should 
not be used to pigeonhole students, but rather to provide 
information indicating how instruction should change over time. 
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Table 1 



Examples of Learning, flaintenance , and Transfer Items 



a 



Problem type Pattern Sanple Prot lem Correct Answer 

Original Learning m NGOHPIQJ (RKSL) 

NINI PZUFQZVF (RZV/F) 

Maintenancj (Learned pattern types; new instantiations) 

Near Transfer (Learned relations and periodicities, but in new 
combinations) 

m DVEVFVGV (HVIV) 

NNNN VHDPWIEQ (XJFR) 

Far Transfer (New relation^ backward-next; or new periodicity, 
three letters) 

BN UCTDSERF (QGPH) 

NBNI JPBXKOCX (LNDX) 

NIN PADQAERA (FSAG) 

Verjr Far Transfer (Backward-next as well as next relations and 

"period" of two letters, but relations must be 
sought between strings of letters rather than 
within a string) 

Instructions ; 

Pretend that you are a spy. You want to send the 
message on top in a secret cod'^ that only your friends will 
understand. Someone has begun coding the message for you on 
the second line. Tr> Ic figure out the secret code and 
finish coding the message by filling in thp oJanks with the 
letters that follow the code. 

SIX SHIPS GONE 

THY R I H Q R (H fJ 0 D) 

a 

The letters themselves in the pattern notations refer to the 
alphabetic relations (i.e., M next, I = identity, B = backward- 
next). The number of letters in each pattern nota*- .on equals the 
period . 
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Table 2 



Multiple Regression Summary Table for Matrices Task 



D-ipendent 


Independent 


Correlation 




2 


Varibule 


Variable 


r 


Multiple R 


Increment in R 


Training 


Information 




.439 


.193* 




Coding 




.587 


.152» 


Transfer 


Information 


-.389* 


.239 


.151» 


Residual 


Estimated IQ 


.485* 


.485 


.235* 


gain 


Ravens 




.608 


.135* 




Training 


-.605* 


.770 


.224» 




Transfer 


-.595» 


.876 


.173* 




Far Transfer 


-.698* 


.884 


.014 
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Table 3 

Multiple Regression Summary Table for Series Completion Task 



Dependent Independent Correlation 2 
Variable Variable r Multiple R Increment in R 



Training Block Design -.476* .1^76 .227* 

Vocabulary -.427* .677 . 153* 



Transfer Block Design -.581* .581 .338* 

Animal House -.479 .641 .073 



Residual Estimated IQ .521* .521 .272* 

gain Ravens .352 .578 .062 

Training -.461* .595 .020 

Transfer -.693* .7^5 .221* 

Far Transfer -.558* .7^5 .000 
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The Acquisition of Question-Asking by a Weak Seventh Grade Student (Charles) 
Day 1: 

TEXT: The water noccasin, somewhat longer tha.i the coppernead, is 

found in the southeastern states. It lives in owampy regions- 
It belongs, as do also the copperhead and the rattlesnakes, to 
a group of poisonous snakes called pit vipers. They have pits 
between their eyes and their nostrils which, because they are 
sensitive to heat, help the snakes tell when th^^y are near a 
warm-blooded animal. Another name for the water moccasin is 
"cottonmouth. " This name comes from the white lining of the 
snake's mouth. 

1) S; What is found in the southeastern snakes, also the copperhead, 

rattlesnakes, vipers - they have. I'm not doing this right. 

2) T: All right. Do you want to know about the pit vipers? 

3) S: Yeah. 

^) T: What would be a good question about the pit vipers that starts with 
the word "why?" 

5) S: (No response) 

6) T: How about, "Why are the snakes called pit vipers?" 

7) S: Wiy do they want to know that they are called pit vipers? 

8) T: Try it again. 

9) S: Why do they, pit vipers in a pit? 

10) T? How about, "V/hy do they call the snakes pit vipers?" 

11) S: V/hy do they call the snakes pit vipers? 

12) There you go! Good for you. 

Day 14: 

TEXT: Spinner's mate is much smaller than she, ard his body is dull 
brown. He spends most of his time setting at one side of her 
web. 

13) S: (No question) 

1^) T: What's this paragraph about? 

15) S: Spinner's mate. How do spinner's mate... 

16) T: That's good. Keep going. 

'7) S: How do spinner's mate is smaller than... How am I going tc say that? 

18) T: Take your time with it. You want to ask a question about spinner's 

mate and what he does, beginning with the word "how." 

19) S: How do they spend most of his time sitting? 

20) T: You're very close. The question would be, "How does spinner's nate 

spend most of his time?" Now, you ask it, 

21) S: How does spinner's mate spend most of his time? 
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Table U (cont'd.) 
Day 7: 

7EXT: Perhaps you are wondering ^here the lava and othQr volcanic 
products come from. Deep within our earth there are pockets 
of molten rock called magma. Forced upward in part by gas 
pressure, this molten rock co inually tries to reach the 
surface. Eventually— by means of cracks in the crustal rocks 
or some similar zone of weakness—the magma may break out of 
the ground. It then flows from the vent as l«vcs or spews 
skyward as dense clouas of lava particles. 

22) S: How does the pressure from below push the mass of hot rock against the 

opening? Is that it? 

23) T: Not quite. Start your question with, "What happens when?" 

24) S: What happens when the pressure from below pushes the mass of hot rock 

against the opening? 

25) T: Good for you! Good job. 

Day 11: 

TEXT: One of the most interesting of the insect-eating plants is the 
ye",us's flytrap. T^is plant lives in only one small area of 
the world — the coastal marshes of North and South Carolina. 
The Venus's flytrap doesn't look unusual. Its habits, 
however, make it truly a plant wonder. 

26) S: V;hat is the most interesting of the insect eating plants, and where do 

the plants live at? 

27) T: Two excellent questions! They ar*^ both clear and important questions. 

Ask us one at a time now. 

Day 15: 

TEXT: Scientists also come to the South Pole to study the strange 

lights- that glow overhead during the Antarctic night. (It's a 
cold and lonely world for the few liardy peopl<^ who "winter 
over" the polar night.) These "southern lights" are caused by 
the Earth acting like a magnet on electrical particles in the 
air . They are clues that may help us understand the Earth's 
core and the upper edges of its blanket of air. 

28) S: Why do scientists come to the south pole to study? 

29) T: Excellent question! That is what this paragraph is all about. 
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Table 5 

Improvement in Question-Asking by a More Competent Seventh Grade Student (Sara) 
Day 2: 



TEXT: HOW CAN SNAKES BE SO FLEXIBLE? 



The snake's skeleton and parts of its body are very flexible- 
something like a rubber hose with bones. A snake's backbone 
ran have as many as 300 vertebrae, almont ten times as many as 
c human's. These vertebrae are connected by loose and rubbery 
tissues that allow easy movement. Because of this bendable, 
twistable spinal construction, a snake can turn its body in 
almost any direction at almost any point. 

Snakes' backbones can have as many as 300 vertebrates - almost 
times as many as humans. 

Not a bad beginning, but I would consider that a question about a 
detail. Try to avoid "fill in the blank" questions. See if next time 
y u can fine a main idea question and begin your question with a 
question word - how, why, when.... 

Day 3: 



1) S: 

2) T: 



TEXT: There are snakes in nearly all parts of the world. Some 

snakes prefer warm, arid desert areas. Others prefer leafy 
forests, fields, and woodlands. Some stay in areas near water 
and are fine swimmers. Then there are several varieties that 
live all their lives in the sea. 

3) S: Can snakes live their whole lives in seas? 

^) T: See if you can ask a question using your own words. 

Day ^: 



TEXT: The other kind of camel— the one with two humps—is the 

Bactrian. Its home country is the Gobi Desert of northeastern 
Asia. The Bactrial has shorter legs and longer wool than the 
one-humped camel. It also has stronger, more rugged feet. 
This is important because instead of having sand to walk on, 
the .actrian camels live in rough and rocky parts of the 
world , 



5) S: V/here is the Bactrian found? 

6) T: Good for you. 



Table 5 (cont'd.) 



Day 6: 

TEXT: When most full-grown spiders want to travel, they have to walk 
on their eight legs. But some small kinds of spiders, and 
many young ones, use an easier way. They clinb up on bushes, 
fence posts, or weed stems and spin streamers of silk. When 
the wind catches the silk and blows it away, each spider 
tightly holds onto his own streamer. The silk streamer 
carries him through the air as if it were a parachute or a 
balloon. 

7) S: I think I have another. When it's traveling, what do they comprre the 

spider to? 

8) T: An interesting question. 

Day 11: 

TEXT: The young caterpillar's first meal is its own eggshell. Then 
it eats a leaf and each day eats more and more food. After a 
few days, the caterpillar becomes too large for its skin. A 
new skin forms beneath the first one, the old skin comes open 
and, like a snake, the caterpillar wriggles its way out of the 
split skin. Then the caterpillar goes on eating leaves or 
other kinds of food. When the rew skin becomes too tight for 
the growing body, it again splits and comes off. By then the 
caterpillar is covered by another skin. This eating and 
shedding goes on for several weeks. The old skin may be 
replaced by a new one four or five times. Each time the skin 
is shed, the size and color of the caterpillar change. 

9) S: Why does the caterpillar's skin split? 

10) T: Excellent question. That was the point of the entire pdragrapn. 
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Figure Captions 

£igur_e K Examples of the learning problems used in the matrices 
study. The top panel contains a rotdtion problem, the middle 
panel an inposition problem, and the bottom panel a subtraction 
problem. 

Figure _2. Examples of the transfer problems used in the matrices 
study. The top panel contians a rotation plus subtraction 
problem, the bottom panel a rotation plus imposition problem. 
Figure 3. A sample hint sequence for rotation problem. 
Figure 4^. The mean number of prompts required on the transfer 
problems as a function of ability and transfer distance. 



HINT 1: 



HINT 2: 



'wHY'll'mGH? il r^M pS rul^r'-'^T ^HINK ABOUT 

■run n MIGHT BE CALLED THAT.. . Do YOU KNOW HOW TO Sni VF 
THE PROBLEM NOW OR DC YOU WANT ANOTHER HINT? 

HIS IS ROW 1. Put picture 1 in the practice box TmirM 

N. Touch the picture. Now try to make ti e picture 

LOOK LIKE THE SECOND PICTURE. (If SUCCESSFUL) Yminin 
IT. Now MAKE IT LOOK LIKF THE LAST PICTURE " ( I F rSn n 
CANNOT MAKE PICTURE 3. PUTO W I l[ G I VE H^NT '2a . 



HINT 2a: 



'Tn^Mc:^^ 5?11'a ^^^^ PICTURE 1. WaTCH HOW IT 

TURNS. Watch Again. Now you do it," (If child 
give^HintIS!)^^^ above demonstration. PLATO will 



HINT 2b: 



"This is row 1. Let's try to make the last pitturf 

in,jM'^°^' f^'^ '"''"^^^ 1 THE JraCT? E B^x! 

I OUCH ys . Touch ys again. Good. You have 

?^lE^.{s^I,^f ^Ic^jSL^^^ ^° ^^^^ 



HINT 3: 



Now LET S LOOK AT ROW 2. PuT PICTURE 1 OF ROW 2 I^i THF 
PRACTICE BOX. NoW MAKE IT LOOK LIKE PICTURE 2 IF rMTin 
DOES NOT RESPOND CORRECTLY PLATO WILL DISPLAY TOUCH ? 
You DID IT. Now MAKE THE P I CTURE N THE PRACT CE Bof^ nn^ 
L KE THE LAST PICTURE IN ROW 2. No'l Iry ThE PRoLeS AGA?n 

(If child cannot make picture 3 PLATO will give Hin? 3a 



HINT 3a: 



"Touch 



Touch ✓% 



AGA i N . 



HINT t\i 



I AMn 9 THE TURNING RULE TO MAKE THE LAST PICTURE IN ROWS 
nc;F^?uc'ci^^ "-^ST PICTURE IN ROW 3 IS MISSING. TrY TO 
USE THE SAME RULE TO MAKE THE MISSING PICTURE IN ROW \ " 
(If CHILD CANNOT DO SO. PLATO WILL GIVE HiNT 4a ) 



HINT 4a: 



"This is the shape you work with. (PLATO displays 

f PROPRIATE SHAPE.) PuT IT IN THE PRACTICE BOX 

Touch THE first picture in row 3 nSS iiScn A . 

Now TOUCH yN AGAIN. ThAT IS CORRECT. ToucH DONE. 



[ooI^M^IlI! !hrff'powI' ^V^' ^^^f^^^^^G PROBLEM): "GoOD. 

vnn ii?rn Tuc t^'^^^ ^HE TURNING RULE IS USED IN EACH ROW Amh 

P?cTUReVto GFT^?rTnPF''^'° f ^^^^^^^ PI CTiRE^'^YoS^TURNES 

pi;ture 3. PICTURE c. Then you turned picture 2 to get 



iF?L^'iE^^N''riwo''pR'o^\EM^?N'ri;ir^^ R07AT ION PROBLEMS 

PLATO Wia MOVE A^T^n -° TMF r oc? ?2n ^^"^"^^T ANY HINTS. ThEN 
ILL nuvb AHEAD .C THE FIRST IMPOSITION PROBLEM. 



